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In this paper, we consider a k-nearest neighbor kernel type estimator when the 
random variables belong in a Riemannian manifolds. We study asymptotic properties 
such as the consistency and the asymptotic distribution. A simulation study is also 
consider to evaluate the performance of the proposal. Finally, to illustrate the potential 
applications of the proposed estimator, we analyzed two real example where two different 
manifolds are considered. 
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1 Introduction 

Let Xi , . . . , Xn be independent and identically distributed random variables taking values 
in IR'^ and having density function /. A class of estimators of / which has been widely 
studied since the work of Rosenblatt (1956) and Parzen (1962) has the form 



where K{u) is a bounded density on IR and h is a sequence of positive number such that 



If we apply this estimator to data coming from long tailed distributions, with a small 
enough h to appropriate for the central part of the distribution, a spurious noise appears 
in the tails. With a large value of h for correctly handling the tails, we can not see the 

details occurring in the main part of the distribution. To overcome these defects, adaptive 
kernel estimators were introduced. For instance, a conceptually similar estimator of f{x) 
was studied by Wagner (1975) who defined a general neighbor density estimators by 
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where Hn{x) is the distance between x and the fc-nearest neighbor of x among Xi, . . . , X„, 
and /c = /c„ is a sequence of non-random integers such that hm„_>oo kn = oo. Through these 
adaptive bandwidth , the estimation in the point x has the guarantee that to be calculated 
using at least k points of the sample. 

However, in many applications, the variables X take values on different spaces than 
IR^. Usually these spaces have a more complicated geometry than the Euclidean space and 
this has to be taken into account in the analysis of the data. For example if we study the 
distribution of the stars with luminosity in a given range it is naturally to think that the 
variables belong to a spherical cylinder {S"^ x IR) instead of IR'^. If we considerer a region 
of the planet M, then the direction and the velocity of the wind in this region are points 
in the tangent bundle of M, that is a manifold of dimension 4. Other examples could be 
found in image analysis, mechanics, geology and other fields. They include distributions 
on spheres. Lie groups, among others, see for example Joshi, et.at. (2007), Goh and Vidal 
(2008). For this reason, it is interesting to study an estimation procedure of the density 
function that take into account a more complex structure of the variables. 

Nonparametric kernel methods for estimating densities of spherical data have been stud- 
ied by Hall, et .al (1987) and Bai, et. al. (1988). Pelletier (2005) proposed a family of non- 
parametric estimators for the density function based on kernel weight when the variables 
are random object valued in a closed Riemannian manifold. The Pelletier's estimators is 
consistent with the kernel density estimators in the Euclidean case considered by Rosenblatt 
(1956) and Parzen (1962). 

As we comment above, the importance of local adaptive bandwidth is well known in 
nonparametric statistics and this is even more true with data taking values on complexity 
space. In this paper, we propose a k-nearest neighbor method when the data takes values 
on a Riemannian manifolds. The proposal combine the ideas of smoothing in Euclidean 
spaces with the estimators introduced in Pelletier (2005). 

This paper is organized as follows. Section [2] contains a brief summary of the neces- 
saries concepts of Riemannian geometry. In Section |2.1[ we introduce a k-nearest neighbor 
estimators on Riemannian manifolds. Uniform consistency of the estimators is derived in 
Section 3.1, while in Section 3.2 the asymptotic distribution is obtained under regular as- 
sumptions. Section |4] contains a Monte Carlo study designed to evaluate the proposed 
estimators. Finally, Section [5] presents two example using real data. Proofs are given in the 
Appendix. 



2 Preliminaries and the estimator 

Let {M,g) be a d— dimensional Riemannian manifold without boundary. We denote by dg 
the distance induced by the metric g. With Bs{p) we denote a normal ball with radius s 
centered at p. The injectivity radius of {M,g) is given by injgM = inf sup{s £ IR > : 

Bs{p) is a normal ball}. Is easy to see that a compact Riemannian manifold has strictly 
positive injectivity radius. For example, it is not difficult to see that the d-dimensional 
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sphere 5*^ endowed with the metric induced by the canonical metric go of R'^^^ has injec- 
tivity radius equal to vr. If A'^ is a proper submanifold of the same dimension than {M,g), 
then injg\^N = 0. The Euclidean space or the hyperbolic space have infinity injectivity 
radius. Moreover, a complete and simply connected Riemannian manifold with non positive 
sectional curvature has also this property. 

Throughout this paper, we will assume that (M, g) is a complete Riemannian manifold, 
i.e. (M, dg) is a complete metric space. Also we will consider that injgM is strictly positive. 
This restriction will be clear in the Section [2.11 when we define the estimator. For standard 
result on diff'erential and Riemannian geometry we refer to the reader to Boothby (1975), 
Besse (1978), Do Carmo (1988) and Gallot, Hulin and Lafontaine (2004). 

Let p G M, we denote with Op and TpM the null tangent vector and the tangent space 
of M at p. Let Bs{p) be a normal ball centered at p. Then -Bs(Op) = exp~^{Bs{p)) is an 
open neighborhood of Op in TpM and so it has a natural structure of differential manifold. 
We are going to consider the Riemannian metrics g' and 5" in ^^(Op), where g' = exp*{g) 
is the pullback of g by the exponential map and g" is the canonical metric induced by gp 
in Bs{Op). Let w £ Bs{Op), and {Uj-ip) be a chart of -Bs(Op) such that w £ U. We note 
by {d/d'tpi\w, ■ ■ ■ ,d/d'il>d\w} the tangent vectors induced by {U,ip). Consider the matricial 
function with entries are given by g' {{d/d-ipi\w) , {d/di/jj\w))- The volumes of the 

parallelepiped spanned by {{d / d'ipi\w) , ■ ■ ■ , (d/d'tpdlw)} with respect to the metrics g' and 
g" are given by | detg' , (d/diPjU)) \^/^ and | det^" {{d/diPi\^) , {d/di^jU)) 1^2 

respectively. The quotient between these two volumes is independent of the selected chart. 
So, given q £ Bs{p), ii w = expp^{q) G -Bs(Op) we can define the volume density function, 
Op{q), on {M,g) as 

"p\Q) = z z r/o 

\det g" {{d/d^PiQ , (d/d^jQ) \^^^ 

for any chart {iJ,'ip) of -Bs(Op) that contains w = exp~^[q). For instance, if we consider a 
normal coordinate system (f7, ip) induced by an orthonormal basis {fi, . . . , Vd} of TpM then 
Op{q) is the function of the volume element dvg in the local expression with respect to chart 
{U, ip) evaluated at q, i.e. 



det gq 
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where g^lg = -D„.(o)expp(dj(0)) with ai{t) = expp^{q) + tvi for q £ U. Note that the 
volume density function Op{q) is not defined for all the pairs p and q in M, but it is if 
dg{p,q) < injgM. 

We finish the section showing some examples of the density function: 

i) In the case of {IR'^, go) the density function is Op{q) = 1 for all {p, q) £ IR'^ x BR!^. 

ii) In the 2-dimensional sphere of radius i?, the volume density is 

\sen{dg{pi,p2)/R)\ , a a ( \ ^ 

dpAP2) = R ^ if P2 / Pi, -Pi and 0p^{pi) = 1. 

dg{pi,p2) 
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where dg induced is given by 

dg{pi,P2) = i?arccos( ). 

iii) In the case of the cyUnder of radius 1 Ci endowed with the metric induced by the 
canonical metric of IR^, 6pi{p2) = 1 for ah {pi,P2) £Ci x Ci, and the distance induced 
is given by dg{pi,p2) = d2((ri, si), (r2, S2)) if c?2((ri,si), (r2,S2)) < tt, where (^2 is the 
EucUdean distance of IR^ and pi = (cos(rj), sin(rj), Sj) for i = 1,2. 

See also, Besse (1978) and Pennec (2006) for a discussion on the volume density function. 



2.1 The estimator 

Consider a probability distribution with a density / on a d— dimensional Riemannian man- 
ifold {M,g). Let Xi, ■ ■ ■ ,Xn be i.i.d random object takes values on M with density /. A 
natural extension of the estimator proposed by Wagner (1975) in the context of a Rieman- 
nian manifold is to consider the following estimator 

f (n) = ^ V 1 ( dg(.P,Xj 
^"^^^ nHi{p)j^^ex,{p) \ Hn{p) 

where K : IR ^ IR is a non-negative function with compact support, 6p{q) denotes the 
volume density function on (M, g) and i/„ {p) is the distance dg between p and the k- 
nearest neighbor of p among Xi, . . . , X„, and k = kn is a sequence of non-random integers 
such that lim„_j.oo kn = 00. 

As we mention above, the volume density function is not defined for all p and q. There- 
fore, in order to guarantee the well definition of the estimator we consider a modification of 
the proposed estimator. Using the fact that the kernel K has compact support, we consider 
as bandwidth Cnip) = ^^i^{Hnip),injgM} instead of Hn{p). Thus, the kernel only consi- 
ders the points Xi such that dg{Xi,p) < Cnip) that are smaller than injgM and for these 
points, the volume density function is well defined. Hence, the k-nearest neighbor kernel 
type estimator is defined as follows, 

/»(.)^;4j|:^-(i|#). a) 

where Cn{p) = min{i?„(p), inj^M}. 

Remark [2TT|l. If{M,g) is a compact Riemannian manifold and its sectional curvature 



is not bigger than a > 0, then we know by the Lemma of Klingerberg (see Gallot, Hulin, 
Lafontaine (2004)) that injgM > min{TT/y/a,l/2} where / is the length of the shortest 
closed geodesic in {M,g). 
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3 Asymptotic results 



Denote by C^{U) the set of i times continuously differentiable functions from U to IR where 
U is an open set of M. We assume that the measure induced by the probabihty P and by 
X is absolutely continuous with respect to the Riemannian volume measure di'g, and we 
denote by / its density on M with respect to dfg. More precisely, let B{M) be the Borel 
(T— field of M (the cj— field generated by the class of open sets of M). The random variable 
X has a probability density function /, i.e. if x ^ ^3{M), P{X^^{x)) = S-yf^^g- 



3.1 Uniform Consistency 

We will consider the following set of assumptions in order to derive the strong consistency 
results of the estimate fn{p) defined in (llj). 



HI. Let Mq be a compact set on M such that: 

i) / is a bounded function such that infpgA-fo fip) = ^ > 0. 

ii) infp,ggA/o 6p{q) = B > 0. 

H2. For any open set Uq of Mq such that Mq C Uq, f is of class on Uq- 

H3. The sequence /c„ is such that A:„ — oo, ^ — >• and — >• c« as n — )■ oo. 

H4:. K : IR ^ IR is a bounded nonnegative Lipschitz function of order one, with compact 
support [0,1] satisfying: J^d i^(||u||)du = 1, J^d uK{\\u\\)du = and 
0</^d||ufi^(||u||)du<oo. 

H5. The kernel K{u) verifies K{uz) > K{z) for all u € (0, 1). 



Remark 3.1,1. The fact that 9p{p) = 1 for all p G M guarantees that HI ii) holds. 
The assumption H?, is usual when dealing with nearest neighbor and the assumption HA is 
standard when dealing with kernel estimators. 



Theorem 3.1,2. Assume that HI to Hb holds, then we have that 



sup \Uv)-f{v)\^^. 

peMo 



3.2 Asymptotic normality 

To derive the asymptotic distribution of the regression parameter estimates we will need 
two additional assumptions. We will denote with Vr the Euclidean ball of radius r centered 
at the origin and with A(Vr-) its Lebesgue measure. 

Hf). f{p) > 0, / S C'^{V) with V C M an open neighborhood of M and the second 
derivative of / is bounded. 
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H6. The sequence kn is such that A;„ — )-oo, /c^/"-— ^Oasn— )-oo and there exists < /3 < oo 
such that y/Kn^W+^ ^ /3 as n — )• oo. 

H7. The kernel verifies 

i) /Ki(||u||)||u||2(iu < OO as s ^ OO where Ki{u) = i^'(||u||)||u||. 

ii) \\u\\'^+^ K2{u) ^ as ||u|| ^ oo where K2{u) = K"(||u||) ||uf - Ki{u) 



Remark ^^1. Note that div{K{\\u\\)u) = K'(||u||)||u|| + d i^(||u||), then using the 
divergence Theorem, we get that / -ftr'(||u||) ||u||du = K(||u||)up^du— d /i^(||u||)du. 

Thus, the fact that K has compact support in [0, 1] imphes that / Ki{u)du = —d. 

On the other hand, note that V(i^(||u||)||u|p) = i^i(||u||)u + 2E:(||u||)u and by iJ4 we 
get that / Ki{\\\i\\)Md\i = 0. 

Theorem [3l2l 2. Assume HA to HI. Then we have that 

^fK{Up)-f{p))^M{b{p),a\p)) 

with 

|n=0 



hip) = _ . \ mA)n\ du Y: 



and 



aHp) = X{Vi)f{p) f K'{\\u\\)du 
JVi 

where u = (ui, . . . , Ud) and {Bh{p),ip) is any normal coordinate system. 

In order to derive the asymptotic distribution of fnip), we will study the asymptotic 
behavior of h'^/C,^{p) where /i^ = A;„/(n/(p)A(Vi)). Note that if we consider fpiv) = 



3.1 



^n/('^Cn(p)-^(1^i))) fn{p) is a Consistent estimator of f{p) (see the proof of Theorem 
The next Theorem states that this estimator is also asymptotically normally distributed as 
in the Euclidean case. 



Theorem 3.2,3. Assume HA to H6, and let = A;„/(n/(p)A(Vi)). Then we have that 



V 



iV(&i(p),l) 



with 



blip) 



2 \ 



T , Jy^ujduLiip) 



\f{p)KVi), 



6d+ 12 



+ 



fip)KVi 



where u = (ui, . . . , Ud), t is the scalar curvature of (M, g), i.e. the trace of the Ricci tensor, 



Li{p) = Yl 



i=l 



duiUi 



u=o dui u=o dui 



n=0 , 



and {Bh{p), ip) is any normal coordinate system. 
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4 Simulations 

This section contains the results of a simulation study designed to evaluate the performance 
of the estimator defined in the Section 12. 1[ We consider three models in two different 
Riemannian manifolds, the sphere and the cylinder endowed with the metric induced by 
the canonical metric of IR^. We performed 1000 replications of independent samples of size 
n = 200 according to the following models: 

Model 1 (in the sphere): The random variables Xi for 1 < i < n are i.i.d. Von 
Mises distribution VM{fi, k) i.e. 

UAX) = Qy^'/i/2(^)exp{/^XV}, 

with /X is the mean parameter, k > is the concentration parameter and /i/2('^) = 
(^) sinh(K) stands for the modified Bessel function. This model has many important 
applications, as described in |16j and [19]. We generate a random sample follows a 
Von Mises distribution with mean (0, 0, 1) and concentration parameter 3. 

Model 2 (in the sphere): We simulate i.i.d. random variables Zi for 1 < i < n 
following a multivariate normal distribution of dimension 3, with mean (0, 0, 0) and 
covariance matrix equals to the identity. We define Xi = for 1 < i < n, therefore 

W^i II 

the variables Xi follow an uniform distribution in the two dimensional sphere. 

Model 3 (in the cylinder): We consider random variables Xi = (yi,ti) taking 
values in the cylinder x M. We generated the model proposed by Mardia and 
Sutton (1978) where, 

y, = (cos(0i),sin(^^)) ~^M((-1,0),5) 

U\yi^ N{l + 2V5cos{ei),l). 

Some examples of variables with this distribution can be found in Mardia and Sutton 
(1978). 

In all cases, for smoothing procedure, the kernel was taken as the quadratic kernel 
K(t) = (15/16)(1 — t^)^/(|x| < 1). We have considered a grid of equidistant values of k 
between 5 and 150 of length 20. 

To study the performance of the estimators of the density function /, denoted by /„, we 
have considered the mean square error (MSE) and the median square error (MedSE), i.e, 

1 " ^ 

MSEif^) = -Y,[fniXi)-fiX,)f. 
MedSE(/„) = median |/„(X,) - f{Xi)\^ . 
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Figure 1 gives the values of the MSE and MedSE of /„ in the sphere model considering 
different numbers of neighbors, while Figure 2 shows the cylinder model. The simulation 
study confirms the good behavior of fc— nearest neighbor estimators, under the different 
models considered. In all cases, the estimators are stable under large numbers of neigh- 
bors. However, as expected, the estimators using a small number of neighbors have a poor 
behavior, because in the neighborhood of each point there is a small number of samples. 




a) b) 
Figure 1: The nonparametric density estimator using different numbers of neighbor, a) the Von 
Mises model and b) the uniform model. 




Figure 2: The nonparametric density estimator using different numbers of neighbor in the cylinder. 
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5 Real Example 



5.1 Paleomagnetic data 

It is well know the need for statistical analysis of paleomagnetic data. Since the work 
developed by Fisher (1953), the study of parametric families was considered a principal tools 
to analyze and quantify this type of data (see Cox and Doell (1960), Butler (1992) and Love 
and Constable (2003)). In particular, our proposal allows to explore the nature of directional 
dataset that include paleomagnetic data without make any parametric assumptions. 

In order to illustrate the k- nearest neighbor kernel type estimator on the two-dimensional 
sphere, we illustrate the estimator using a paleomagnetic data set studied by Fisher, Lewis, 
and Embleton (1987). The data set consist in n = 107 sites from specimens of Precambrian 
volcanos whit measurements of magnetic remanence. The data set contains two variables 
corresponding to the directional component on a longitude scale, and the directional com- 
ponent on a latitude scale. The original data set is available in the library sm of R statistical 
package. 

To calculate the estimators the volume density function and the geodesic distance were 
taken as in the Section [2] and we considered the quadratic kernel K{t) = (15/16)(1 — 
t^)^/(|x| < 1). In order to analyzed the sensitivity of the results with respect to the number 
of neighbors, we plot the estimator using different bandwidths. The results are shown in 
the Figure 3. 

The real data was plotted in blue and with a large radius in order to obtain a better 
visualization. The Equator line, the Greenwich meridian and a second meridian are in gray 
while the north and south pols are denoted with the capital letter N and S respectively. 
The levels of concentration of measurements of magnetic remanence can be found in yellow 
for high levels and in red for lowest density levels. Also, the levels of concentration of 
measurements of magnetic remanence was illustrated with relief on the sphere that allow 
to emphasize high density levels and the form of the density function. 

As in the Euclidean case large number of neighbors produce estimators with small 
variance but high bias, while small values produce more wiggly estimators. This fact shows 
the need of the implementation of a method to select the adequate bandwidth for this 
estimators. However, this require further careful investigation and are beyond the scope of 
this paper. 
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Figure 3: The nonparametric density estimator using different number of neighbors, a) k=75, 
b)k=50, c) k=25 and d) k=10. 

5.2 Meteorological data 

In this Section, we consider a real data set collected in the meteorological station "Agiiita de Perdiz" 
that is located in Viedma, province of Ri'o Negro, Argentine. The dataset consists in the wind 
direction and temperature during January 2011 and contains 1326 measurements that were registered 
with a frequency of thirty minutes. We note that the considered variables belong to a cylinder with 
radius 1. 

As in the previous Section, we consider the quadratic kernel and we took the density function 
and the geodesic distance as in Section [2j Figure 4 shows the result of the estimators, the color and 
form of the graphic was constructed as in the previous example. 

It is important to remark that the measurement devices of wind direction not present a sufficient 
precision to avoid repeated data. Therefore, we consider the proposal given in Garci'a-Portugues, 
et.al. (2011) to solve this problem. The proposal consists in perturb the repeated data as follows 
Ti = ri -\- S,£i, where denote the wind direction measurements and Si, for i = l,...,n were 
independently generated from a von Mises distribution with /i = (1,0) and k = 1. The selection of 
the perturbation scale ^ was taken ^ ~ n^^/^ as in Garcia-Portugues, et.al. (2011) where in this 
case n = 1326 
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The work of Garcia- Portugues, et.al. (2011) contains other nice real example where the proposed 
estimator can be apply. They considered a naive density estimator applied to wind directions and 
S02 concentrations, that allow you explore high levels of contamination. 




c) d) 

Figure 4: The nonparametric density estimator using different number of neighbors, a) k=75, 
b)k=150, c) k=300 and d) k=400. 

In Figure 4, we can see that the lowest temperature are more probable when the wind comes 
from the East direction. However, the highest temperature does not seem to have correlation with 
the wind direction. Also, note that in Figure 4, we can see two mode corresponding to the minimum 
and maximum daily of the temperature. 

These examples show the usefulness of the proposed estimator for the analysis and exploration 
of these type of dataset. 



Appendix 



Proof of Theorem l3J^2. 

Let 



no^ ^ Ox, I 



K 



dg{p,Xi 



Note that if 6„ = Sn{p) verifies 6in < 6n{p) < (52ji for all p G A/q where 6in and (52n satisfy Sin — > 

and — — — >■ oo as n — >■ oo for i = 1, 2 then by Theorem 3.2 in Henry and Rodriguez (2009) we have 
logn 

that 

sup |/„(p,<5„)-/(p)| ^0 (2) 

For each < /3 < 1 we define, 

where D-(,9) = P^/'^'^K, D+{P) = ji-^'^'^K and /i^ = fc„/(nA(Vi)/(p)) with A(Vi) denote the 
Lebesgue measure of the ball in M'^ with radius r centered at the origin. Note that 

sup - Pf{p)\ ^ and sup - rV(p)l ^ 0. (3) 

peMo peMo 

For all < /3 < 1 and £ > we define 



5-(/3,e) 


= {w 


: sup |/„ {p,f5)~ f{p)\ <e 






peMa 




= {w 


: sup \f+{p,l3)~f{p)\<e 






peMa 


Sn{s) 


= {w 


■■ sup \fn{p) - < e }, 










= {fn 


(p,/3)</„(p)</+b,/3)} 



Then, ^„(/3) n S-{l3,e) n c Snie). Let A = suppgj^j^ /(p). For < e < 3A/2 and 

= 1 — ^ consider the following sets 

G„(e) = {w : < C„(p) < D+{p,) for all p e Mq} 

G-(e) = { sup |/-(p,/3e) - /3s/(p)| < ^} 

G+(e) = { sup - < 

Then we have that G„(£) c A„{l3^), G-{e) c S'~(^e,£) and G+{e) c S'+(/3e,e). Therefore, G„(e) n 
G-(£)nG+(£)c5„{e). 
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On the other hand, using that limr^oV{Br{p))/r''-iJ.{Vi) = 1, where V{Br{p)) denotes the 
volume of the geodesic baU centered at p with radius r (see Gray and Vanhccke (1979)) and similar 
arguments those considered in Devroye and Wagner (1977), we get that 



sup 

peMo 



nX{V,)f{p)Hd{p) 



- 1 



Recall that injgM > and H^{p) 0. Then for straightforward calculations we obtained that 



"A(Vi)/(p)C,1(p) 



0. Thus, Ic^ie) ~^ and (3 1 imply that Is^{e) ~^ 0- □ 



Proof of Theorem 137212. 

A Taylor expansion of second order gives 



<{p)j^,OxM V Up) 



where 

Bn 

and 



{ n 



{p) \ h. 



kniiK/Cip)) - 1) < f{p) + 



-fip)}, 

{hn/Up))~'^]hi 1^ 1 (dg{p,X,, 



m/Up))-i]Up) nhij^^ex,{p) '\ Up) 

nr((hdird(„\\ i J(^"/c«(p))-i]' I ^ I ( dg{p,Xj) \ 
/c„((/.„/c„(p))-i)2[(,,/^,(^))_^j l^;— jt^"/'^"] 



with hf^ ~ fc„/n/(p)A(Vi) and min(/i„,C„) < ^„ < max(/i„,C„). Note that H6 implies that /i„ 
satisfies the necessary hypothesis given in Theorem 4.1 in Rodriguez and Henry (2009), in particular 



By the Theorem and the fact that hn/C,n{p) — > 1, we obtain that An converges to a normal 
distribution with mean b{p) and variance cr^(p)- Therefore it is enough to show that _B„ and C„ 
converges to zero in probability. 

Note that '"n^J fld'^^-!}~} d^^ and by similar arguments those considered in Theorem 3.1 in 



Ci^/CKp))-! 



Pelletier (2005) and Remark 3.2 1. we get that 



dg{p,Xj 



1 " 1 

^S^^^'V Up) 



A I Xi(u)du/(p) = -d/(p). 



Therefore, by Theorem |3.2[ 3., we obtain that i?„ — ^ 0. As ^„//i„ converges to one in probability, 
in order to concluded the proof, it remains to prove that 

n 

-vif-^ E ^r^l^2 (dg(p,x,)/e„) I 
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is bounded in probability. 

By H7, there exits r > such that \t\''+^\K2it)\ < 1 if \t\ > r. Let Cr = i-r,r), then we have 



that 



1 ^ 1 



\ ^ SUP|,|<,|X2W| " 1 / 



dg{P,Xj) 



1 " 1 

— — V / 



dg{p,Xj 



dg{p,Xj 



As min(/i„, Cn(p)) < < niax(ft,„, („(p)) = ^„ it follows that 
1 ^ 1 



<iiip) 



,=1 '^M 



, dg{p,Xj] 



in 



< 



< 



sup 

\t\<r 



Cn(p)/ |t|<r nhf^ f-{0X,{P) V 

1 " 1 



dgiP,Xj) 

C'n{p)ir{dxApy^''\ Up) 



1 V 1 , 

= + C„2 + ^,13 + C„4. 



dg{p,Xj 



dg{p,Xj) 



dg{P,Xj] 

Up) 



-id+1) 



in 

Up) 



in 
(d+l) 



(d+1) 



By similar arguments those considered in Theorem 3.1 in PcUctier (2005), we have that C„i 
f{p)JlcAs)ds and C„3 ^ /(p) //c=(s)|s|-('^+i)ds. 

Finally, let Af^ = {(1 - £)/i„ < Cn < (1 + for < e < 1. Then for n large enough 

P(A^) > 1 — £ and in Af^ we have that 



< . / dg{Xj,p) 
- ^'V(l+e)/^n 



dg{Xj,p) 

Up) 



Cn(p) 



V(i 



dg{Xj,p) 



(1 - 



^(d+i) 



Up) 



(1 - e)/in 



(d+l) 



This fact and analogous arguments those considered in Theorem 3.1 in Pellcticr (2005), allow to 
conclude the proof.n 



Proof of Theorem 137213. 

Denote 6„ = ^5^/(1 + zkn^^^), then 

Pi^/KihfjC - 1) < ^) = ^(C' > b„) = P{H^ > b„, injgM^ > 6„). 
As 6„ — > and injgM > 0, there exists uq such that for all n > uq we have that 

P{Hf^ > 6„, tnjgM'' > bn) = P{Hf^ > hn). 
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Let Zi such that Zi = l when dg{p,Xi) < bl/'^ and Zi = elsewhere. Thus, we have that P{H^ > 

bn) = P{J27=i — ^n)- Let Qn = P{dg{jp,Xi) < bl/'^). Note that qn ^ and nqn oo as n ^ oo, 
therefore 



)) < i — {kn - nqn) I . 



Using the Lindeberg Central Limit Theorem we easily obtain that (ng„) ^^^X^"=i(-^i — E(Zi)) 
is asymptotically normal with mean zero and variance one. Hence, it is enough to show that 

(ng„)"^^^(fc„ - nqn) z + bi{p). 

Denote by /U„ = n (/(?) ~ f{p))dvg{q). Note that /x„ = n qn — Wn with Wn = 

n f{p)V{B^u4p)). Thus, 

1 / \ 1/2 / \ 1/2 



:{kn - nq„) = ' {kn - Wn) ( ^ + 1/2 \ 

Let {B^i/d{p),%p) be a coordinate normal system. Then, we note that 

— ^ / f{q)dvg{q) = I f o r\n)6^ ° r\n)dn. 

The Lebesgue's Differentiation Theorem and the fact that — — r )■ 1 imply that — 0. 

A(V^i/<i) Wn 

On the other hand, from Gray and Vanhecke (1979), we have that 



Hence, we obtain that 



,n-l/2('l. -y,) - Wn ' knZkn ' Wn ' rbj"^ kn ,,,-1/2 i„ n/h'^/d) 



+ Bn + Cn- 

1/2 



It's easy to see that An ^ z and bi!'^ kn = ''{f(p)x(Vi))-y'' (^vWu^(p))^ ' ^^^^^ 

obtain that Bn ^ t /3('^+*)/''/(6d + 12) (/(p)m(Vi))"^/''. A similar argument shows that C„ 
and therefore we get that Wn^^^{kn — Wn) z + 6rf+i2 (/W"^(^i))~'^^^- 



In order to concluded the proof wo will show that — — )■ — — — - — , . , / m? du Li(p). 

Wn'^ (/(p)A(Vi))^<*+2)/<' Jvi 

use a second Taylor expansion that leads to, 

/ {f{q)-f{p))di^A'l) = E^^^ir-U=oe'/'/ Ui9,oi;-\bi/'^u)du 



i,J = l 

+ 0(6j,+3/rf). 
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Using again a Taylor expansion on 6pOtp ^(•) at we have that 

/ if{<l)-f{p))di^,{q) = bl+'/' [ uldnL,(p) + 0{b'+-'"') 

JB^i,i(p) JVi 

and by H6 the theorem follows.D 
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